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ABSTRACT 


The main aim of this paper is to provide a deep analysis on the 
research field of healthcare data analytics., as well as highlighting 
some of guidelines and gaps in previous studies. This study has 
focused on searching relevant papers about healthcare analytics by 
searching in seven popular databases such as google scholar and 
springer using specific keywords, in order to understand the 
healthcare topic and conduct our literature review. The paper has 
listed some data analytics tools and techniques that have been used to 
improve healthcare performance in many areas such as: medical 
operations, reports, decision making, and prediction and prevention 
system. Moreover, the systematic review has showed an interesting 
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demographic of fields of publication, research approaches, as well as 


outlined some of the possible reasons and issues associated with 
healthcare data analytics, based on geographical distribution theme. 
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1. INTRODUCTION 

The rapid development in technology facilitated the 
healthcare system to generate huge amounts of 
information related to the patient followed by their 
records pertaining to disease, treatment history, test 
reports, etc. 


The concern towards healthcare is increasing day by 
day with the rapid increase in population. 
Traditionally data is available in the form of hard 
copies, but due to technology development and in the 
era of digital world everything is collected and stored 
in the digitized form. It can be predicted that in the 
future there will be significant growth in the health 
data. The other organization like insurance 
companies, government organizations are using this 
data to provide health benefits to the patients. 
Analysis of disease patterns, tracking the disease can 
help us to find a cure for the disease, and dealing with 
disease eruption can eradicate the chances of its 
growth. Such analysis can enhance the level of public 
health and awareness and rapid actions to control the 
diseases. Development of required vaccines by the 
medical researchers. Converting large amounts of 
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health information into predictive models to 
recognize the needs, the services to be provided, 
predict, and prevent a health disaster for the benefit of 
people. Improvement in operational functionality can 
also add light to the healthcare system. For instance, 
by investigating patient history, frequency of patient 
visits, and analyzing staff efficiency, healthcare 
facilities can optimally allocate healthcare staff to a 
particular shift without having to overstaff or under 
staff. Predictive analytics is vital to achieving the goal 
of providing better care and cutting down on 
healthcare costs simultaneously. 


With the use of disease predictive modeling, virtual 
assistance can also be provided to patients. Based on 
the confidence rate, with very basic information input 
about symptoms patients can get insights about the 
various possible diseases. Drug discovery is very time 
and cost consuming. Mutation profiles and patient 
metadata can be used by various pharmaceutical 
companies for the discovery of new drugs using 
Machine learning Algorithms. One of the critical 
elements in the examination and figuring out the 
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treatment strategy is medical image analysis. 
Mammography, X-ray, MRI, and others provide vital 
insights into the patient’s disease. Accuracy of the 
images and its interpretation needs to be top-notch in 
medical Image analysis. Supervised machine learning 
algorithms can help in medical image analysis by 
improving its Image resolution, modality difference, 
and the dimension of images. 


2. BACKGROUND 

Data is an imperative aspect of every industry and 
organization. Due to the internet, around 2TB of data 
is being generated by the users every day. This data 
includes several types of forms that may be either 
structured or unstructured. 


In the earlier, the Doctors were not able to monitor 
the patients’ health condition in real time. As disease 
increases, the cost to cure it also increases, so to cure 
the extreme diseases was very expensive. 


Nowadays, with the help of Data Science and 
Machine Learning applications, Doctors can get real- 
time information about the patients’ health condition 
through their wearable devices. Based on_ this 
information, the Doctors can send the junior doctors, 
nurses, or assistants to the patient’s home. 


Not only the Doctors can provide immediate 
assistance and treatment to the patients, but they can 
also install several required equipment and devices 
for the patient’s diagnosis. These devices can collect 
heart rate, body temperature, blood pressure, etc., 
information through updates and notification of 
mobile applications on the top of Data Science. 


3. METHODS 

The objective of this paper was to conduct a review, 
which encourages professionals, doctors, medical 
staff and patients to adopt and utilize technologies in 
order to assist healthcare analytics and improve 
decision making process in our everyday life. 


Our method has followed three steps: 1) searching for 
initial and related studies, 2) Relevance appraisal and 
evaluation, and finally extracting data. The next 
sections will explain these steps briefly. 


Searching for initial and related studies: the first step 
in order to find the articles was to specify and identify 
main keywords (Dieste et al. 2009). A survey was 
conducted to study relevant papers published since 
2010 in the information system field in general and 
healthcare analytics and medical decision support 
system in specific. This study has found that most 


relevant keywords to “healthcare analytics” and “data 
mining” used with technology to support medical 
information systems. 


The following searching phrases were used and 
structured in searching for relevant papers in many 
different databases — 1.e. the relevant and related 
papers should contain in its titles, keywords, abstract 
or full text the word “healthcare” along with any of 
“analytics”, “metrics”, “data mining”, “big data” or 
“decision making”. see table 1 


Group 1 Group 2 


E-Healthcare Analytics 
Medical Practice Metrics 
Health Decision Making 
Clinical Prediction 
Hospitals Big Data 
Care systems Data Mining 
Wellness Business 
programs Intelligence 


Table: Main keyword 


Initial List of Papers 


N=73,542 


Remains Papers after 
Filtering by Title 


Remains Papers after 
Filtering by Abstract 


N= 468 N=165 


Relevant Papers 
——> 
N=81 


Fig.1 Search Process 


As a result, searching techniques relied on choosing 
any key word from the (group A) and linked it with 
any word of (group B) to form searching keywords 
statement, such as (“E-Healthcare” OR “Medical 
Practice” OR “Health” OR “Clinical” OR “Hospitals” 
OR “Care Systems” OR “Wellness Programs”) AND 
“Analytics” OR “Metrics” OR “Decision Making” 
OR “Prediction” OR “Big Data” OR “Data Mining” 
OR “Business Intelligence”. 


Once the keywords were identified, 7 online 
databases were searched to find the initial list of the 
studies. In the search, titles, keywords, abstract and 
full text were considered and the search was limited 
to studies published since 2010, inclusive. 


The databases were searched over multiple subjects 
and returned total of 73,542 articles (see Figure 1). 
This study found some of the papers are indexed by 
multiple databases. As it shown in Table 2, total 
number of the papers after deducting the repeated 
papers was 81. 
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Papers Filtered By 


Papers Filtered By Papers Filtered 


Abstract By Text 
Number Numberof Number Number 
offound repeated of found of 

articles articles articles repeated 
articles 

47 11 25 9 
39 5 20 

21 4 7 2 
44 5 31 4 

29 3 14 

15 4 6 1 

7 5 3 1 

165 81 


Table 2: Initial list of studies 


Title 
Name of Initial Number Number of 
Database list of offound repeated 
papers articles articles 
Google 37,000 72 11 
Scholar 
IEEExplor 6,791 103 5 
ACM Digital 2041 49 
Library 
ProQuest 16,139 133 8 
Scopus 1,380 104 3 
(Elsevier) 
Springer 8568 43 9 
Wiley 1623 11 5 
InterScience| 
Total 73,542 468 
4. RESULTS 


4.1. Fields of Publication 

This systematic review has found Information 
Systems with 43 papers and Healthcare with 31 
papers as most active communities in the research 
related to the healthcare analytics topic, however 7 
papers were published related to the healthcare 
analytics in Computer Science. Figure 3 shows that 
authors focused on information systems and 
healthcare fields more than computer Science. 


As it shown in Figure 2, academic papers related to 
the healthcare analytics and decision making were 
mostly published in information systems and 
healthcare for the reason that most studies recently 
have focused on improving healthcare analytics using 
data mining and business intelligence techniques, 
however a few were published in the field of 
computer science. This could be because of computer 
scientist were dealing with old traditional methods 
and trying to solve general issues using these methods 
rather than suggesting new techniques due to the 
evolution of technology in these days, as well as most 
of articles were trying to use old methods and 
involving only doctors and professionals in healthcare 
analytics process in order to improve clinical and 
hospitals performance as an interest topic in computer 
science, paying no attention for the importance to 
involve patients in that. 


4.2. Application Areas 

This paper found that healthcare analytics papers has 
focused on six main areas: (a) healthcare decision 
making (b) predictions of diseases & patient sickness 
preventions (c) clinical delivery (d) clinical 
operations, performance monitoring & reporting (e) 
Improved diagnoses, treatment and results and finally 
(f) Healthcare information exchange. However, there 
was some studies focused on financial and supply 
chain management, but due to the small number of 
publications in these areas, we removed them from 
this Figure. 


As it shown in Figure 3 most attention was paid to 
improve healthcare analytics performance and results, 
therefore the Figure below illustrates that most 
publications focused healthcare prediction and 
preventions, decision making process and healthcare 
treatment and monitoring. 


For many researchers, the main factors of reaching 
high level of healthcare analytics were in simplifying 
unstructured clinical records, as well as capturing 
patient’s behavior and encourage individuals to 
educate themselves, as well as keep following ups, by 
adopting technologies and internet services and 
applications, for example social networking sites & 
social media which may allow them to keep updated 
and connected with other patients worldwide, sharing 
their health information and supporting each other in 
order to improve healthcare analytics and reduce 
costs. 
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Information Systems 


Healthcare 


Computer Science 
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Number of Papers 


rT 
Figure 2: Fields of Publications 


Healthcare information & 
diagnoses share 
Improved diagnoses, 
treatment and results 
Clinical operations, 


performance monitoring & 


Clinical delivery 
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Figure 3: Application Areas 


ADVANTAGES 
Support clinical treatment decisions 
physicians and other health professionals. 


vu 


from 


> Improve the accuracy and speed of identifying 
patients at highest risk of disease. 


> Provide greater detail in the EHRs of individual 
patients. 


> Make the provision of healthcare more efficient, 
which reduces costs. 


> Promote preventive meas 


> ures by giving patients greater insight into their 
health and treatment goals. 


> Integrate data from consumer fitness devices and 
other patient-provided sources of health data. 


> Deliver real-time alerts to healthcare providers by 
analyzing health data at the collection point. 


6. FUTURE SCOPE 

There have been many improvements done in the 
healthcare sector, but still, some more applications 
and improvements are required in the future like: 


> Digitalization 

> Technological Inclusion 

> Reduced Cost of Treatment 
> 


Need to be able to handle huge amount of 
patient’s information 


Data science tools and technologies are working for 
these requirements and have made many 
improvements as well. Data science is doing wonders 
in many real-life areas and contributing a lot. There 
will be much assistance available for doctors and 
patients through this revolution of data science in the 
future. 


7. CONCLUSION 

Finally, we conclude that data science has many 
applications in healthcare. The healthcare industry is 
heavily dependent on Data Science for its 
improvement. Additionally, with the development 
and advancement of medical image analysis, it will be 
possible for physicians to find microscopic tumors 
that are in fact difficult to find manually. Therefore, 
data science has revolutionized healthcare and the 
large-scale medical industry. 
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